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ABSTRACT OF THE DISCIiOSURE 



A DNA segment having a sequence encoding a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E .coli , or an oligonucleotide fragment 
thereof is provided. The DNA segment permits selection of 
DNA and amino acid sequences unique to a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E.coli . Novel DNA segments or proteins 
can thus be constructed which contain the unique DNA and 
amino acid sequences. The DNA segments or oligonucleotide 
fragments thereof may be use to detect the presence of a 
DNA segment having a sequence encoding a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E.coli. Selected DNA segments or 
oligonucleotide fragments thereof may be used in methods 
for detecting pathogenic verotoxin-producing E.coli . 
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Titles Attaching and Effacing Protein of Enterohemorrhagic 



FIELD OF -PHE IMVENTIOM 

The present invention relates to a DNA segment having a 
sequence encoding a protein associated with attaching and 
effacing activity of enterohemorrhagic E.coli , or an 
oligonucleotide fragment thereof. The DNA segment of the 
invention can be used to produce a protein associated with 
attaching and effacing activity of enterohemorrhagic 
E .coli or a part thereof, by culturing a transformant host 
cell which includes a recombinant molecule comprising a 
DNA segment of the invention or an oligonucleotide 
fragment thereof and an expression control sequence 
operatively linked to the DNA segment. 

The DNA segment of the invention also permits selection of 
DNA and amino acid sequences unique to a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E.coli . Novel DNA segments or proteins 
can thus be constructed which contain the unique DNA and 
amino acid sequences. The invention also relates to uses 
of the DNA segments encoding a protein associated with 
attaching and effacing activity of enterohemorrhagic 
E.coli . and fragments thereof, and uses of a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E .coli or parts thereof. 

BACKGROIIMD OF THE IMVENTIOM 

Enteropathogenic Escherichia coli (EPEC) colonize the 
intestine of humans (Ulshen, M., and Rollo J. (1980) N. 
Engl. J. Med. 302, 99-102) and experimental animals, 
(Tzipori, S., Robbins -Brown, R.M., Gonis, G. , Hayes, J., 
Withers, M. , and McCartney, E. (1985) Gut 21, 570-578) 
producing a characteristic lesion which has been called 
attaching and effacing (AE) (Moon, H.W., Whipp, S.C., 
Argenzio, R.A., Levin, M.M. , and Gianella, R.A. 1983. 



Infect. Iiranun, 41, 1340-1351). The lesion consists of an 
extremely close (lOnm) approximation of the bacterial 
outer membrane to the intestinal epithelial cell membrane. 
The microvilli of the enterocyte are effaced and there is 
polymerization of actin in the cytoplasm subjacent to the 
area of bacterial attachment (Knutton, S., Baldwin, T. , 
Williams, P.H., and McNeish, A.S. (1989) Infect. Immun. 
57, 1290-1298). The AE lesion is also produced by 
enterohemorrhagic E.eoli (EHEC) of serogroups 026, 0111 
and 0157 (Tzipori, S., Wachsmuth, I., Smithers, J., and 
Jackson, C, (1988) Gastroenterology 94, 590-597; Tzipori, 
S., Wachsmuth, I.K., Chapman, C, Birner, R. , Brittingham, 
J., Jackson, C, and Hogg, J. (1986) J. Infect. Dis. 154, 
712-716) . 

Escherichia coll serotype 0157sH7 is the most important 
member of the group of E.eoli which produce verotoxins and 
is a major public health concern. Among the verotoxin- 
producing B .coli ^ serotype 0157 sH7 is most commonly 
associated with human disease. This serotype is a 
recognized cause of hemorrhagic colitis, occurring both 
sporadically and in major outbreaks, and is also a cause 
of the hemolytic uremic syndrome especially among children 
and the elderly. These organisms are probably acquired by 
humans from an animal source via contaminated meat and 
occasionally unpasteurized dairy products. Outbreaks of 
disease caused by 0157 jH7 have been associated with 
consumption of undercooked hamburgers and raw milk (Riley 
IiW, Remis RS, Helgerson SD, et al. N Engl J Med 1983; 
308s681-5; Borczyk AA, Karmali MA, Lior H, et al . 
(Letter). Lancet 1987; ls98; Duncan L, Mai V, Carter A, 
et al. Outbreak of gastrointestinal disease - Ontario. Can 
Dis Wkly Rep 1987; 13-2! 5-8). 

A method that would identify these organisms in animal and 
food samples would be very useful for screening infected 
herds and contaminated food. Presently, the detection of 
verotoxin and DNA probes for verotoxin do not distinguish 
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verotoxin-producing E.coll which are important causes of 
human disease such as 0157 sH? from those which, although 
present in animals, rarely if ever cause human disease. 

5 Jerse et al. ((1990) Proc. Natl. Acad. Sci. USA 87, 7839- 

7843) and Donnenberg et al. ((1990) Infect. Immun. 58, 
1565-1571) have identified a gene in the EPEC strain 
E2348/69 which was necessary but not sufficient for the 
formation of the AE lesion. This gene of approximately 

10 3kb, which was designated eae, encodes a 94 kilodalton 

outer membrane protein (Jerse, A.E. and Kaper, J.B. (1991) 
Abst. Ann. Am. Soc . Microbiol. B-112, p.44) . It was shown 
that E2348/69 mutants carrying transposon TnPhoA 
insertions in the eae gene were unable to produce the AE 

15 lesion when incubated with HEp-2 cells (Donnenberg M., 

Calderwood, S., Donohue-Rolf e. A., Keusch, G.T., and 
Kaper, J. (1990) Infect. Immun. 58, 1565-1571). A Ikb 
Stul-Sall fragment which encompasses the central one third 
of the EPEC eae gene was found to hybridize with DNA 

20 isolated from bacteria of classical EPEC serogroups as 

well as with DNA isolated from EHEC of serogroups 026 and 
0157 (Jerse, A., Yu, J., Tall, B., and Kaper, J. (1990) 
Proc. Natl. Acad. Sci. USA 87, 7839-7843). Although both 
EPEC and EHEC strains cause ultrastructurally similar 

25 lesions, as shown in gnotobiotic piglets (Moon, H.W., 

Whipp, S.C., Argenzio, R.A., Levin, M.M., and Gianella, 
R.A. 1983. Infect. Immun. 41, 1340--1351; Tzipori, S., 
Wachsmuth, I.K., Chapman, C, Birner, R., Brittingham, J., 
Jackson, C, and Hogg, J. (1986) J. Infect. Dis. 154, 712- 

30 715) and in tissue culture cells (Knutton, S., Baldwin, 

T., Williams, P.H. , and McNeish, A.S. (1989) Infect. 
Immun. 57, 1290-1298), the two types of infection can be 
differentiated by anatomic site of involvement, suggesting 
differences in adherence factors. 



SUMMARY OF THE IMVENTIOM 

The present Invention provides a purified and isolated DNA 
segment having a sequence which codes for a protein 
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associated with attaching and effacing activity of 
enterohemorrhagic E.coli . In a preferred embodiment, a 
purified and isolated DNA segment is provided having a 
sequence which codes for a protein associated with 

5 attaching and effacing activity of enterohemorrhagic 

E.coli having an amino acid sequence which has substantial 
homology with the amino acid sequence as shown in the 
Sequence Listing as SEQ ID N0:1. Most preferably, the 
purified and isolated DNA segment has a sequence having 

10 substantial sequence homology with the nucleotide sequence 

as shown in the Sequence Listing as SEQ ID NOsl. 

The invention also relates to a recombinant molecule 
adapted for transformation of a host cell comprising the 
15 DNA segment of the invention operatively linked to an 

expression control sequence. A transformant host cell 
including a recombinant molecule of the invention is also 
provided. Still further, this invention provides plasmids 
which comprise the recombinant molecules of the invention. 

20 

The present invention further relates to an avirulent 
strain of an enterohemorrhagic E.coli comprising an 
avirulent bacterial carrier strain transformed with a 
recombinant molecule of the invention, and a vaccine 
25 composition comprising a bacterial carrier strain 

transformed with a recombinant molecule of the invention. 

The invention also provides a method of preparing a 
protein associated with attaching and effacing activity of 

30 enterohemorrhagic E . coll , utilizing a DNA segment of the 

invention. The method comprises culturing a transformant 
host cell including a recombinant molecule comprising a 
DNA segment of the invention and an expression control 
sequence operatively linked to the DNA segment, in a 

35 suitable medium until the protein is formed and thereafter 

isolating the protein. 



207871.6 



The invention still further, provides, a substantially pure 
protein associated with attaching and effacing activity of 
enterohemorrhagic E .coli . In a preferred embodiment the 
protein has an amino acid sequence which has substantial 
homology with the amino acid sequence as shown in the 
Sequence Listing as SEQ ID NOil. 

The invention also relates to antibodies specific for an 
epitope of a protein of the invention, preferably a 
monoclonal antibody, and a method for preparing the 
antibodies . 

A diagnostic kit for detecting enterohemorrhagic E.coli . 
preferably E .coli 0157 in a sample comprising a monoclonal 
antibody of the invention and directions for its use is 
also provided. 

The invention also contemplates an immunoassay for the 
detection of enterohemorrhagic E .coli . preferably E .coli 
0157, comprising contacting a sample suspected of 
containing enterohemorrhagic E.coli with a monoclonal 
antibody of the invention to form an immune complex; and 
determining the presence of the immune complex. 

The DNA segments of the invention or oligonucleotide 
fragments of the DKA segments, allow those skilled in the 
art to construct nucleotide probes for use in the 
detection of nucleotide sequences in samples such as 
biological or food samples. The nucleotide probes may be 
used to detect nucleotide sequences that encode proteins 
related to or analogous to the attaching and effacing 
protein of the invention. 

Accordingly, the invention provides a method for detecting 
the presence of a DNA segment having a sequence encoding 
a protein related to or analogous to a protein associated 
with attaching and effacing activity of enterohemorrhagic 
E.coli or an oligonucleotide fragment thereof in a sample. 
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comprising contacting the sample with a nucleotide probe 
capable of hybridizing with the DNA segment or an 
oligonucleotide fragment, thereof, to form a hybridization 
product, under conditions which permit the formation of 
5 the hybridization product, and assaying for the 

hybridization product. 

The invention further provides a kit for detecting the 
presence of a DNA segment having a sequence encoding a 

10 protein related to or analogous to a protein associated 

with attaching and effacing activity of enterohemorrhagic 
E .coli or an oligonucleotide fragment thereof in a sample, 
comprising a nucleotide probe capable of hybridizing with 
the DNA segment or an oligonucleotide fragment thereof, 

15 reagents required for hybridization of the nucleotide 

probe with the DNA segment or an oligonucleotide fragment 
thereof, and directions for its use. 

The DNA segment of the invention also permits the 
20 identification and isolation, or synthesis, of nucleotide 

sequences which may be used as primers to amplify a DNA 
segment of the invention or an oligonucleotide fragment 
thereof , for example in the polymerase chain reaction 
(PGR). 

25 

Accordingly, the invention relates to a method of 
determining the presence of a DNA segment having a 
sequence encoding a protein associated with attaching and 
effacing activity of enterohemorrhagic E.coli or a 

30 predetermined oligonucleotide fragment thereof in a 

sample, comprising treating the sample with primers which 
are capable of amplifying the DNA segment or the 
predetermined oligonucleotide fragment thereof, in a 
polymerase chain reaction to form amplified sequences, 

35 under conditions which permit the formation of amplified 

sequences, and, assaying for amplified sequences. 
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The invention further relates to a kit for determining the 
presence of a DNA segment having a sequence encoding a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E.coli or a predetermined 

5 oligonucleotide fragment thereof in a sample, comprising 

primers which are capable of amplifying the DNA segment or 
the predetermined oligonucleotide fragment thereof in a 
polymerase chain reaction to form amplified sequences, 
means for assaying the amplified sequences, and directions 

10 for its use. 

The present inventors have designed specific nucleotide 
sequences for detecting pathogenic verotoxin-producing 
B .coli strains . The present invention therefore further 

15 provides a purified and isolated nucleotide sequence 

comprising the sequence shown in the Sequence Listing as 
SEQ ID N0s2j the sequence shown in the Sequence Listing as 
SEQ ID N0s3; the sequence shown in the Sequence Listing as 
SEQ ID N0s4, the sequence shown in the Sequence Listing as 

20 SEQ ID NO: 5; the sequence shown in the Sequence Listing as 

SEQ ID NO; 5; the sequence shown in the Sequence Listing as 
SEQ ID NOs?; the sequence shown in the Sequence Listing as 
SEQ ID NO: 8; the sequence shown in the Sequence Listing as 
SEQ lb NO: 9; a sequence having substantial homology 

25 thereto, or a fragment of the nucleotide sequence. 

Accordingly, the present invention relates to a method for 
detecting verotoxin-producing E.coli of the serogroups 05, 
026, 0103, 0111, 0118, 0145, and 0157 in a sample 

30 comprising contacting the sample with a Sall-StuI fragment 

comprising the sequence as shown in the Sequence Listing 
as SEQ ID NO: 2 and the sequence as shown in the Sequence 
Listing as SEQ ID NO: 3, under conditions which permit the 
fragment to hybridize with a complementary sequence in the 

35 sample to form a hybridization product, and assaying for 

the hybridization product. 
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The invention also contemplates a method for detecting 
vero toxin-producing E.coli serogroup 0157 in a sample 
comprising contacting the sample with a 0.5 kb fragment 
comprising the sequence as shown in the Sequence Listing 
5 as SEQ ID NO; 4 and the sequence as shown in the Sequence 

Listing as SEQ ID N0s5, under conditions which permit the 
fragment to hybridize with a complementary sequence in the 
sample to form a hybridization product, and assaying for 
the hybridization product. 

10 

A method for detecting pathogenic verotoxin-producing 
E.coli strains of the serogroup 0157 in a sample is also 
provided comprising treating the sample with the 
nucleotide sequence PIEH shown in the Sequence Listing as 

15 SEQ ID N0s5 and the nucleotide sequence P2EH5 shown in the 

Sequence Listing as SEQ ID N0s7 in the polymerase chain 
reaction, under conditions which permit the formation of 
amplified sequences which terminate with the nucleic acid 
sequence of one of the primers and the complementary 

20 nucleic acid sequence of the other of the primers, and, 

assaying for the amplified sequences . 

The invention further provides a kit for detecting 
verotoxin-producing E.coli of the serogroups 05, 026, 

25 0103, 0111, 0118, 0145, and 0157 in a sample comprising a 

Sall-StuI fragment comprising the sequence as shown in the 
Sequence Listing as SEQ ID NO: 2 and the sequence as shown 
in the Sequence Listing as SEQ ID NOsB, reagents required 
for hybridization of the fragment with a DNA segment 

30 encoding the attaching and effacing protein of 

enterohemorrhagic E.coli or an oligonucleotide fragment 
thereof, and directions for its use. 

The invention also relates to a kit for detecting 
35 verotoxin-producing E.coli of the serogroup 0157 in a 

sample is comprising a 0.5 kb fragment (A3-B2) comprising 
the sequence as shown in the Sequence Listing as SEQ ID 
NO 5 4 and the sequence as shown in the Sequence Listing as 
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SEQ ID NO: 5, reagents required for hybridization of the 
fragment with a DNA segment encoding the attaching and 
effacing protein of enterohemorrhagic E.coli or an 
oligonucleotide fragment thereof, and directions for its 
use. 

The invention further relates to a kit for detecting 
pathogenic verotoxin-producing E.coli strains of the 
serogroup 0157 in a sample comprising the primers PIEH as 
shown in the Sequence Listing as SEQ ID N0s5 and P2EH as 
shown in the Sequence Listing as SEQ ID NOs?, 
respectively, reagents required for the amplification in 
a PGR reaction of sequences which terminate with the 
nucleic acid sequence of one of the primers and the 
complementary nucleic acid sequence of the other of the 
primers, and directions for its use. 

DESCRIPTION QF THE DRAWINGS 

The invention will now be described in relation to the 
drawings in which : 

Figure 1 shows the DNA sequence of the EHEC eae gene and 
the deduced protein sequence starting at the start codon 
at 206 bp; 

Figure 2 shows the homology between the deduced protein 
sequences of the EPEC and EHEC eae genes f 

Figure 3 shows the amino acid sequence alignment of the 
predicted EPEC and EHEC proteins; 

Figure 4 shows the results of colony blot hybridizations 
of various 0 serogroups using A) a central (C1-C2) 1 kb 
fragment and B) a 3' {A3-B2) 0.5 fragment of the 0157 :H7 
eae gene; 
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Figure 5 shows the results of colony blot hybridizations 
of various O serogroups using a 0.45 kb fragment (PIEH- 
P2EH) of the 0157 5 H7 eae gene; 

Figure 6 shows the primer locations on the 3 ' region of 
EHEC eae gene; 

Figure 7 shows the amplification products of VTEC and EPEC 
using specific primer sets A3-B2 (0.5 kb) and P1EH-P2EH 
(0.45 kb); 

Figure 8 shows the results of colony blot hybridizations 
of serogroups 0157 (VT+) , 0157 (VT-) , 05, 026, 0111, 0113, 
0121, and 0127 with [a^^pj ^^^ip end-labeled with P2EH 
oligonucleotide; and 

Figure 9 shows the alignment of the deduced amino acid 
sequences at the 3' end ofs 0157: EHEC eae 0157 :H7 
protein; 0111s EHEC eae 0111 :H8 protein; EPEC: EPEC eae 
0127 :H6 protein; and INV : Y. pseudotuberculosis invasin. 

DETAILED DESCRIPTION OF THE INVENTION 

The present inventors have identified a chromosomal eae 
gene associated with attaching and effacing activity in 
enterohemorrhagic E .coli (EHEC) . The EHEC eae gene was 
identified from EHEC serotype 0157 :H7 strain CL-8. The 
entire predicted structural gene was sequenced including 
205 bp upstream and 118 bp downstream. Figure 1 and the 
sequence shown in the Sequence Listing as SEQ ID NOsl, 
show the DNA sequence of the EHEC eae gene and the deduced 
protein sequence starting at the start codon at 206 bp. 
Comparison of nucleotide sequence similarity between the 
EHEC and EPEC eae genes shows 97% homology in the first 
2200 bp and only 59% homology in the last 800 bp (Figure 
2). The amino acid sequence alignment of the predicted 
EPEC and EHEC proteins is shown in Figure 3 . The N- 
terminal amino acids are highly conserved for the first 
685 amino acids while for the C-terminal amino acids, only 
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short stretches can be aligned. There is conservation of 
cysteine residues at positions 859 and 933 of the 
alignment. 

5 Both the EHEC and EPEC sequences show similarity to the 

Yersinia pseudotuberculosis invasin gene (Isberg, R.R. et 
al., (1987) Cell 50, p. 769) with greatest divergence at 
the C-terminus. The C-terminal end of the Yersinia inv 
gene is associated with receptor binding (Leong et al, 

10 EMBO J 1990; 6s 1979-89) and it is possible that the same 

applies to the eae gene products. 

The inventors also probed vero toxin-producing E .coli 
(VTEC) (251 strains) of multiple 0-serogroups with a 

15 highly conserved Ikb Stu l-Sall central fragment from the 

EHEC eae gene using colony hybridization. Only the 
following VTEC serogroups (125 strains) were found to be 
positives 05, 025, O103, 0111, 0118, 0145, and 0157. 
Using the polymerase chain reaction (PCR) a less conserved 

20 0.5kb probe was generated from the 3' end of the EHEC eae 

gene. Among the 251 VTEC strains tested using colony 
hybridization, only those strains belonging to the 0157 
serogroup were positive. Additionally, the 3 'end of eae 
genes of VTEC strains appear to have unique nucleotide 

25 sequences as determined on selected 0-serogroups. These 

0-serogroups initially hybridized with the highly 
conserved central regions of the eae genes . 

Therefore, the present inventors have shown that the 
30 central 1 kb fragment of the EHEC eae gene is homologous 

in a variety of O serogroups of verotoxin-producing E .coli 
(VTEC) strains but that there is diversity at the C- 
terminal end. The heterogeneity at the 3' end of eae genes 
allows for specific detection of clinically relevant VTEC 
35 strains and may be associated with differences in receptor 

binding . 
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Accordingly, the present invention provides a purified and 
isolated DNA segment having a sequence which codes for a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E .coli . 

5 

In a preferred embodiment, a purified and isolated DNA 
segment is provided having a sequence which codes for a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E .coli having an amino acid sequence 

10 which has substantial homology with the amino acid 

sequence as shown in the Sequence Listing as SEQ ID N0:1. 
Most preferably, the purified and isolated DNA segment has 
a sequence having substantial sequence homology with the 
nucleotide sequence as shown in the Sequence Listing as 

15 SEQ ID NOsl. 

DNA segments of the present invention encoding a protein 
associated with attaching and effacing activity of EHEC or 
related or analogous sequences may be isolated and 

20 sequenced, by selectively amplifying the region of the eae 

gene using the polymerase chain reaction method and 
genomic DNA. It is possible to design synthetic 
oligonucleotide primers from the sequence shown in the 
Sequence Listing as SEQ ID NOsl for use in PGR and for 

25 screening genomic libraries. An amplified fragment can be 

cloned and characterized by DNA sequence analysis. DNA 
segments of the present invention encoding the protein 
associated with attaching and effacing activity of EHEC 
may also be constructed by chemical synthesis and 

30 enzymatic ligation reactions using procedures known in the 

ar,t. 

It will be appreciated that the Invention includes 
nucleotide or amino acid sequences which have substantial 
35 sequence homology with the nucleotide and amino acid 

sequences shown in the Sequence Listing as SEQ ID NOsl. 
The term "sequences having substantial sequence homology" 
means those nucleotide and amino acid sequences which have 

- 12 - 
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slight or inconsequential sequence variations from the 
sequences disclosed in the Sequence Listing as SEQ ID N0:1 
i.e. the homologous sequences function in substantially 
the same manner to produce substantially the same 
polypeptides as the actual sequences. The variations may 
be attributable to local mutations or structural 
modifications . 

It will also be appreciated that a double stranded 
nucleotide sequence comprising a DNA segment of the 
invention as shown in Figure 1 or an oligonucleotide 
fragment thereof, hydrogen bonded to a complementary 
nucleotide base sequence (see Sequence Listing SEQ ID 
N0:1), an RNA made by transcription of this doubled 
stranded nucleotide sequence, and an antisense strand of 
a DNA segment of the invention or an oligonucleotide 
fragment of the DNA segment, are contemplated within the 
scope of the invention. 

A number of unique restriction sequences for restriction 
enzymes are incorporated in the DNA segment identified in 
the Sequence Listing as SEQ ID NOsl, and these provide 
access to nucleotide sequences which code for polypeptides 
unique to the protein associated with attaching and 
effacing activity of EHEC. DNA sequences unique to the 
protein associated with attaching and effacing activity of 
EHEC or isoforms thereof, can also be constructed by 
chemical synthesis and enzymatic ligation reactions 
carried out by procedures known in the art. 

The DNA segments of the invention or oligonucleotide 
fragments of the DNA segments, allow those skilled in the 
art to construct nucleotide probes for use in the 
detection of nucleotide sequences in samples such as 
biological specimens or food samples. A nucleotide probe 
may be labelled with a detectable marker such as a 
radioactive label v/hich provides for an adequate signal 
and has sufficient half-life such as ^^P, ^H, or the 
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like. Other labels which may be used include antigens 
that are recognized by a specific labelled antibody, 
fluorescent compounds, enzymes, antibodies specific for a 
labelled antigen, and chemiluminescent compounds. An 
5 appropriate label may be selected having regard to the 

rate of hybridization and binding of the probe to the 
nucleotide to be detected and the amount of nucleotide 
available for hybridization. 

10 The nucleotide probes may be used to detect DMA segments 

having sequences that encode proteins related to or 
analogous to the protein associated with attaching and 
effacing activity of EHEC. 

15 Accordingly the present invention also relates to a method 

for detecting the presence of a DNA segment having a 
sequence encoding a protein related to or analogous to a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E.coli or an oligonucleotide fragment 

20 thereof in a sample, comprising contacting the sample with 

a nucleotide probe capable of hybridizing with the DNA 
segment or an oligonucleotide fragment thereof to form a 
hybridization product, under conditions which permit the 
formation of the hybridization product, and assaying for 

25 the hybridization product. The nucleotide probe may be 

labelled with a detectable marker as described herein and 
the hybridization product may be assayed by detecting the 
detectable marker or the detectable change produced by the 
detectable marker. 

30 

The invention also provides a kit for detecting the 
presence of a DNA segment having a sequence encoding a 
protein related to or analogous to a protein associated 
with attaching and effacing activity of enterohemorrhagic 
35 E.coli or an oligonucleotide fragment thereof in a sample, 

comprising a nucleotide probe capable of hybridizing with 
the DNA segment or an oligonucleotide fragment thereof, 
reagents required for hybridization of the nucleotide 

" 14 - 
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probe with the DNA segment or an oligonucleotide fragment 
thereof, and directions for its use. 

The DNA segment of the invention permits the 
identification and isolation, or synthesis of nucleotide 
sequences which may be used as primers to amplify a DNA 
segment of the invention or an oligonucleotide sequence 
thereof, for example in the polymerase chain reaction 
(PGR) . The length and bases of the primers for use in the 
PGR are selected so that they will hybridize to different 
strands of the desired sequence and at relative positions 
along the sequence such that an extension product 
synthesized from one primer when it is separated from its 
template can serve as a template for extension of the 
other primer into a nucleic acid of defined length. 

The primers may be prepared using techniques known in the 
art such as for example phosphotriester and phosphodiester 
methods or automated techniques. Restriction endonuclease 
digests may also be used as primers. 

It will be appreciated that the primers may contain non- 
complementary sequences provided that a sufficient amount 
of the primer contains a sequence which is complementary 
to the DNA segment of the invention or oligonucleotide 
sequence thereof, which is to be amplified. Restriction 
site linkers may also be incorporated into the primers 
allowing for digestion of the amplified products with the 
appropriate restriction enzymes facilitating cloning and 
sequencing of the amplified product. 

It will also be appreciated that the invention includes 
nucleotide sequences which have substantial sequence 
homology with the above-mentioned nucleotide probes and 
primers. Further, it will be appreciated that a double 
stranded nucleotide sequence comprising a nucleotide probe 
or primer, hydrogen bonded to a complementary nucleotide 
base sequence, an RNA made by transcription of this 
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doubled stranded nucleotide sequence, and an antisense 
strand of a nucleotide probe or primer, are contemplated 
within the scope of the invention. 

In an embodiment of the invention a method of determining 
the presence of a DNA segment having a sequence encoding 
a protein associated with attaching and effacing activity 
of enterohemorrhagic E.coli or a predetermined 
oligonucleotide fragment thereof in a sample, is provided 
comprising treating the seimple with primers which are 
capable of amplifying the DNA segment or the predetermined 
oligonucleotide fragment thereof in a polymerase chain 
reaction to form amplified sequences, under conditions 
which permit the formation of amplified sequences and, 
assaying for amplified sequences. 

The polymerase chain reaction refers to a process for 
amplifying a target nucleic acid sequence as generally 
described in Innis et al. Academic Press, 1990 and U.S. 
Patent 4,800,159. Generally, amplification of the target 
nucleic acid sequence may be accomplished by means of a 
pair of primers which flank the nucleic acid sequence to 
be amplified. The primers hybridize to opposite strands 
of the target sequence and DNA synthesis proceeds across 
the region between the primers, thereby doubling the 
amount of that DNA segment. Repeated cycles of 
denaturation, priming and extension permit rapid 
exponential amplification of the target sequence. 

The amplified products can be isolated and distinguished 
based on their respective sizes using techniques known in 
the art. For example, after amplification, the DNA sample 
can be separated on an agarose gel and visualized, after 
staining with ethidium bromide, under ultra violet (UV) 
light. DNA may be amplified to a desired level and a 
further extension reaction may be performed to incorporate 
nucleotide derivatives having detectable markers such as 
radioactive labelled or biotin labelled nucleoside 
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triphosphates. The detectable markers may be analyzed by 
restriction and electrophoretic separation or other 
techniques known in the art. 

5 The conditions which may be employed in the methods of the 

invention using PGR are those which permit hybridization 
and amplification reactions to proceed in the presence of 
DNA in a sample and appropriate complementary 
hybridization primers. Conditions suitable for the 

10 polymerase chain reaction are generally known in the art. 

Preferably, the PGR utilizes Tag polymerase (GeneAmp Kit, 
Perkin Elmer Cetus) as the polymerization agent and each 
cycle consists of the following! denaturation at 94°C X 1 
min; annealing at 55°C X 1 min; and extension at 72°C X 1 

15 min. 

The invention still further provides a kit for determining 
the presence of a DNA segment encoding a protein 
associated with attaching and effacing activity of 

20 enterohemorrhagic E.coli or a predetermined 

oligonucleotide fragment thereof in a sample comprising 
primers which are capable of amplifying the DNA segment or 
the predetermined oligonucleotide fragment thereof, in a 
polymerase chain reaction to form amplified sequences, 

25 means for detecting the amplified sequences, and 

directions for its use. 

The present invention also relates to a method of 
detecting verotoxin-producing E.coli using the DNA segment 

30 of the invention and oligonucleotide fragments thereof. As 

hereinbefore mentioned the present inventors have found 
that a Sall-StuI fragment (C1-C2) comprising the sequences 
as shown in the Sequence Listing as SEQ ID N0s2 and SEQ ID 
NO: 3, may be used to detect verotoxin-producing E.coli of 

35 the serogroups 05, 026, 0103, 0111, 0118, 0145, and 0157. 

A 0.5 kb fragment (A3-B2) comprising the sequences as 
shown in the Sequence Listing as SEQ ID N0I4 and SEQ ID 
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N0s5 has also been found to be useful for detecting 
verotoxin-producing E.coll serogroup 0157. 

The present inventors have also designed specific 
5 nucleotide probes for detecting pathogenic verotoxin- 

producing E.coli strains of the serogroup 0157. In 
particular, the synthetic nucleotide probes PIEH and P2EH 
shown in the Sequence Listing as SEQ ID NO: 6 and SEQ ID 
N0s7, respectively, were found to specifically detect only 
10 verotoxin-producing E.coli serotypes 0157 sH7 and 0157 sNM. 

Non verotoxin-producing 0157:NM strains and other 0157 sH 
serotypes were not detected by PIEH and P2EH, making these 
probes ideal for detecting pathogenic 0157 BHEC. 

15 The synthetic nucleotide probes PIEH and P2EH shown in the 

Sequence Listing as SEQ ID NO: 6 and SEQ ID NO: 7, 
respectively, were also used in PGR and were found to 
specifically detect only verotoxin-producing E.coli 0157 
serogroups. Non verotoxin-producing 0157 serogroups were 

20 not detected by PIEH and P2EH, making these probes ideal 

for detecting pathogenic 0157 EHEC. 

The nucleotide sequences PIO and P20 shown in the Sequence 
Listing as SEQ ID NO: 8 and SEQ ID NO: 9, respectively, 
25 based on the 0111 sH8 sequence were found to detect 

verotoxin-producing 0111:H8 and 0111:NM but not 0111:H11 
strains indicating that there is further diversity in the 
carboxyl -terminal within the serogroup. 

30 Accordingly, the present invention provides a purified and 

isolated nucleotide sequence comprising the sequence shown 
in the Sequence Listing as SEQ ID NO: 2; the sequence shown 
in the Sequence Listing as SEQ ID NO: 3; the sequence shown 
in the Sequence Listing as SEQ ID NO: 4, the sequence shown 

35 in the Sequence Listing as SEQ ID NO: 5; the sequence shown 

in the Sequence Listing as SEQ ID NO: 6, the sequence shown 
in the Sequence Listing as SEQ ID NO: 7; the sequence shown 
in the Sequence Listing as SEQ ID NO: 8; the sequence shown 
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in the Sequence Listing as SEQ ID NO: 9; a sequence having 
substantial homology thereto or a fragment of the 
nucleotide sequence. The location of the sequences of the 
sequence shown in the Sequence Listing as SEQ ID N0s2, SEQ 
ID N0;3, SEQ ID N0:4, SEQ ID N0s5, SEQ ID NOsS and SEQ ID 
N0s7 on the 0157 sH7 eae gene are shown in Figure 6. 

The invention relates to a method for detecting verotoxin- 
producing E.coli of the serogroups 05, 026, 0103, 0111, 
0118, 0145, and 0157 in a sample, comprising contacting 
the sample with a Sall-StuI fragment (C1-C2) comprising 
the sequence as shown in the Sequence Listing as SEQ ID 
NO: 2 and the sequence as shown in the Sequence Listing as 
SEQ ID NO! 3, under conditions which permit the fragment to 
hybridize with a complementary sequence in the sample to 
form a hybridization product, and assaying for the 
hybridization product. 

The invention relates to a method for detecting verotoxin- 
producing E.coli serogroup 0157 in a sample comprising 
contacting the sample with a 0.5 kb fragment (A3-B2) 
comprising the sequence as shown in the Sequence Listing 
as SEQ ID NO 8 4 and the sequence as shown in the Sequence 
Listing as SEQ ID NOj5, under conditions which permit the 
fragment to hybridize with a complementary sequence in the 
sample to form a hybridization product, and assaying for 
the hybridization product. 

Hybridization conditions which may be used in the method 
of the invention are known in the art and are described 
for example in Sambrook J, Fritch EF, Maniatis T. In: 
Molecular Cloning, A Laboratory Manual, 1989. (Nolan C, 
Ed.), Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY. The hybridization product may be assayed using 
techniques known in the art. The oligonucleotide fragments 
may be labelled with a detectable marker as described 
herein and the hybridization product may be assayed by 
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detecting the detectable marker or the detectable change 
produced by the detectable marker. 

The invention still further relates to a method for 
detecting pathogenic verotoxin-producing E.coli strains of 
the serogroup 0157 in a sample comprising treating the 
sample with the nucleotide sequence PIEH shown in sequence 
the Sequence Listing as SEQ ID NO; 6 and the nucleotide 
sequence P2EH5 shown in the Sequence Listing as SEQ ID 
NO: 7 in a PGR reaction, under conditions which permit the 
formation of amplified sequences which terminate with the 
nucleic acid sequence of one of the primers and the 
complementary nucleic acid sequence of the other of the 
primers, and, assaying for the amplified sequences. 

The PGR reaction and suitable conditions permitting 
formation of amplified sequences are described above. 

A kit for detecting verotoxin-producing E.coli of the 
serogroups 05, 026, 0103, 0111, 0118, 0145, and 0157 in a 
sample is provided comprising a Sall-StuI fragment (G1-C2) 
comprising the sequence as shown in the Sequence Listing 
as SEQ ID N0S2 and the sequence as shown in the Sequence 
Listing as SEQ ID N0t3, reagents required for 
hybridization of the fragment with a DNA segment encoding 
the attaching and effacing protein of EHEC or an 
oligonucleotide fragment thereof, and directions for its 
use. 

A kit for detecting verotoxin-producing E . coli of the 
serogroup 0157 in a sample is also provided comprising a 
0.5 kb fragment {A3-B2) comprising the sequence as shown 
in the Sequence Listing as SEQ ID N0s4 and the sequence as 
shown in the Sequence Listing as SEQ ID NO: 5, reagents 
required for hybridization of the fragment with a DNA 
segment encoding the attaching and effacing protein of 
EHEC or an oligonucleotide fragment thereof, and 
directions for its use. 
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The invention also contemplates a kit for detecting 
pathogenic verotoxin-producing E.coli strains of the 
serogroup 0157 in a sample comprising the primers PIEH and 
P2EH as shown in the Sequence Listing as SEQ ID NOsS and 
5 as SEQ ID NO: 7, respectively, reagents required for the 

amplification in a PGR reaction of sequences which 
terminate with the nucleic acid sequence of one of the 
primers and the complementary nucleic acid sequence of the 
other of the primers, and directions for its use. 

10 

The DNA segment of the present invention having a sequence 
which codes for a protein associated with attaching and 
effacing activity of EHEC, or an oligonucleotide fragment 
of the DNA segment including the nucleotide sequences of 

15 probes and primers described herein may be incorporated in 

a known manner into a recombinant molecule which ensures 
good expression of the protein or part thereof. In 
general, a recombinant molecule of the invention contains 
the DNA segment or an oligonucleotide fragment thereof of 

20 the invention and an expression control sequence 

operatively linked to the DNA segment or oligonucleotide 
fragment. The DNA segment of the invention or an 
oligonucleotide fragment thereof, may be incorporated into 
a plasmid vector, for example, pTZlSR. 

25 

The protein associated with attaching and effacing 
activity of EHEC or isoforms or parts thereof, may be 
obtained by expression in a suitable host cell using 
techniques known in the art. Suitable host cells include 

30 prokaryotic or eukaryotic organisms or cell lines, for 

example, E.coli JM 101 and E.coli LE 392. The protein or 
parts thereof may be prepared by chemical synthesis using 
techniques well known in the chemistry of proteins such as 
solid phase synthesis (Merrifield, 1964, J. Am. Chem. 

35 Assoc. 85! 2149-2154) or synthesis in homogenous solution 

(Houbenweyl, 1987, Methods of Organic Chemistjry, ed. E. 
Wansch, Vol. 15 I and II, Thieme, Stuttgart). 
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The protein associated with attaching and effacing 
activity of EHEC or isoforms or parts thereof, of the 
invention may be expressed in an avirulent bacterial 
carrier strain such as Salmonella and Shigella strains. 
Accordingly, in a further aspect of the invention an 
avirulent strain of an EHEC is provided comprising an 
avirulent bacterial carrier strain transformed with a 
recombinant molecule of the invention. The avirulent 
strain may provide the basis for a vaccine composition 
which may be useful for effecting immunity against 
diseases caused by EHEC, in particular B.coli serotype 
0157 :H7. The invention therefore also provides a vaccine 
composition comprising a bacterial carrier strain 
transformed with a recombinant molecule of the invention. 
The vaccine composition may be useful in effecting 
immunity against diseases such as hemorrhagic colitis and 
hemolytic uremic syndrome. 

The vaccine compositions can be prepared by per se known 
methods for the preparation of pharmaceutically acceptable 
vaccines which can be administered to patients . The 
vaccine composition may be in an oral or injectable form 
and may include pharmaceutically acceptable vehicles . On 
this basis, the vaccine compositions include, albeit not 
exclusively, solutions of the bacterial carrier strain 
transformed with a recombinant molecule of the invention 
in association with one or more pharmaceutically 
acceptable vehicles or diluents, and contained in buffered 
solutions with a suitable pH and iso-osmotic with the 
physiological fluids. 

The protein associated with attaching and effacing 
activity of EHEC or parts thereof, may be used to prepare 
monoclonal or polyclonal antibodies . Antibodies may be 
prepared which bind a distinct epitope in an unconserved 
region of the protein. Conventional methods can be used to 
prepare the antibodies. As to the details relating to the 
preparation of monoclonal antibodies reference can be made 
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to Coding, J.W., Monoclonal Antibodies s Principles and 
Practice, 2nd Ed., Academic Press, London, 1986. In 
general monoclonal antibodies are prepared by obtaining 
hybridomas derived from immortalizing antibody producing 
5 cells obtained from a mammal having been immunized with a 

protein, and screening the hybridomas for production of 
antibody which binds the isolated protein. 

The polyclonal or monoclonal antibodies may be used to 
10 detect a protein associated with attaching and effacing 

activity of EHEC in various biological materials or food 
samples, for example they may be used in an Elisa, 
radioimmunoassay or histochemical tests. a?hus, the 
antibodies may be used as diagnostic reagents to quantify 
15 the amount of protein associated with attaching and 

effacing activity in a sample. 

The monoclonal and polyclonal antibodies may be labelled 
with a detectable marker including various enzymes, 

20 fluorescent materials, luminescent materials and 

radioactive materials . Examples of suitable enzymes 
include horseradish peroxidase, biotin, alkaline 
phosphatase, p-galactosidase, or acetylcholinesterase; 
examples of suitable fluorescent materials include 

25 umbelliferone, fluorescein, fluorescein isothiocyanate, 

rhodamine, dichlorotriazinylamine fluorescein, dansyl 
chloride or phycoerythrin; an example of a luminescent 
material includes luminol; and examples of suitable 
radioactive material include l''^^, l"^ or tritium. 

30 

The invention will be more fully understood by reference 
to the following examples. However, the examples are 
merely intended to illustrate embodiments of the invention 
and are not to be construed to limit the scope of the 
35 invention. 

EXAMPLES 
Example 1 
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Cloning and sequencing of eae gene homologue from EHEC 
strain 0157 

The following materials and methods were used in the 
investigations illustrated in the examples 

Bacteriological media 

E.coll strains were grown in L~broth and L-agar (Lech, K. 
and Brent, R. (1989) in Current Protocols in Molecular 
Biology (Ausubel, F.M. et al. Eds.) John Wiley & Sons, New 
York) supplemented when necessary with lOOug/ml 
carbenicillin (Sigma Chemical Co., St. Louis, MO). For 
production of single stranded DNA, strains were grown in 
2xYT broth (Sajmbrook, J., Fritch, E.F., and Maniatis, T. 
(1989) in Molecular Cloning, A Laboratory Manual (Nolan, 
C, Ed.) Cold Spring Harbor Laboratory Press, New York). 
Bacteriophage lambda was grown in lambda broth (Lech, K. 
and Brent, R. (1989) in Current Protocols in Molecular 
Biology (Ausubel, F.M. et al. Eds.) John Wiley & Sons, New 
York) , 

Bacterial strains and vectors 

E.coli strains E2348/69 (EPEC) , serotype 01275H6 and CL8 
(EHEC), serotype 0157 :H7 were obtained from 
Dr. M. Karmali, The Hospital for Sick Children, Toronto, 
Canada, and have been previously described (Levine, M.M. , 
Nataro, J. P., Karch, H. , Baldini, M.M., Kaper, J.B. , 
Black, R.E., Clements, M.L, and O'Brien, A.D. (1985) J. 
Infect. Dis. 152, 550-559; Karmali, M.A. , Petric, M., Lim, 
C, Fleming, P.C., Arbus, G.S. and Lior, H. (1985) J. 
Infect. Dis. 151, 775-882; Sherman, P., Soni, R. and 
Karmali, M.A. (1988) Infect. Immun. 56, 756-761). E.coli 
JMlOl (Messing, J. (1979) Recomb. DNA Tech. Bull. 2,43-48) 
was used as the host for plasmid pTZ18R (Mead, D.A. , 
Szczesna-Skorupa, E,, and Kemper, B. (1986) Protein 
Engineering. 1,67-74) and E.,&oli LE392 (Borck, K., Beggs, 
J.D., Brammar, W.J., Hopkins, A.S., and Murray, N.E. 
(1976) Mol. Gen. Genet. 146, 199-207; Murray, 
N.E. , Brammar, W.J., and Murray, K. (1977) Mol. Gen. Genet. 
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150,53-61) was used for propagation of bacteriophage 
lambda GEM-11 (Promega, Madison, WI). Bacteriophage M13K07 
(Rokeach, L. , Haselby, j. and Ahoch, S. (1988) Proc. Nat. 
Acad. Sci. USA. 85, 4832-4836) was used as a helper to 
produce single stranded DNA from pTZlSR and recombinant 
derivatives . 



DNA manipulation and sequencing 

Plasmid DNA was prepared by the method of Birnboim and 
Doly (Birnboim, H.C. and Doly, j. (1979) Nucl. Acids Res. 
7, 1513-1523). Bacterial genomic DNA was extracted using 
hexadecyltrimethyl ammonium bromide precipitation (Murray, 
M.G., and Thompson, W.F. (1980) Nucl. Acids Res. 8,4321- 
4325). Genomic DNA isolated from the EHEC strain CL8 was 
partially digested with Sau3A and fragments were size- 
fractionated by sucrose density gradient centrifugation 
(Sambrook, j., Pritch, E.F., and Maniatis, T. (1989) in 
Molecular Cloning, A Laboratory Manual (Nolan, C, Ed.) 
Cold Spring Harbor Laboratory Press, New York). The 
desired fragments (14-23kb) were isolated and ligated to 
dephosphorylated lambda Gem- 11 BainHlarms (Promega, 
Wisconsin, MA] and packaged using the Packagene System 
extracts (Promega, Wisconsin, MA) . Bacteriophage particles 
were propagated in E.coli LE392 and plated for plaque 
isolation. 



The 1 Kb Stul-Sall fragment described by Jerse et al 
(Jerse, A., Yu, J., Tall, B., and Kaper, J. (1990) Proc. 
Natl. Acad. Sci. USA 87, 7839-7843) was used as a probe 
for the eae gene. It was produced by amplification of the 
central region of the eae gene of strain E2348/69 by the 
polymerase chain reaction (Sambrook, J., Fritch, E.F., and 
Maniatis, T. (1989) in Molecular Cloning, A Laboratory 
Manual (Nolan, c, Ed.) Cold Spring Harbor Laboratory 
Press, New York) using Tag polymerase (Perkin Cetus Elmer 
Corp., Norwalk, CT) . Primers flanking the Stul and Sail 
sites were designed using the nucleotide sequence of the 
eae gene of E2348/69 deposited in Genbank by Jerse et al . 
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(Jerse, A., Yu, J., Tall, B., and Kaper, J. (1990) Proc . 
Natl. Acad. Sci. USA 87, 7839-7843). The upstream primer 
was ATGGAATTCTCGTCACAGTTGCAGGCCTGGT (2241-2253 of the eae 
sequence) and the downstream primer was 
ATGGAATTCCGAAGTCTTATCA- GCCGTAAAGT (3350-3328 of the eae 
sequence) with EcoRl recognition sequences included at the 
5 'end of each primer. The fragment was labelled with a'^P 
dATP using a random priming kit (Boehringer Mannheim) .This 
probe was used in plaque hybridizations to further 
identify clones containing the putative eae sequence. 

Fragments which hybridized with the probe were subcloned 
in pTZlBR (Sambrook J, Fritch EF, Maniatis T. In: 
Molecular cloning, A Laboratory Manual, 1989. (Nolan C, 
Ed.), Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY.), Nested deletions were constructed by 
sequential digestion with exonuclease III and SI nuclease 
using the Erase-a-base System (Promega, Madison, WI) . 
Single stranded templates were prepared from fragments 
cloned in pTSlSR by superinfection with bacteriophage 
M13K07. DNA sequencing was performed using the Sanger 
dideoxy chain termination method (Sanger, F., Niklen, S, 
and Coulson, A. R. (1977) Proc .Nat .Acad. Sci. USA. 74, 5463- 
5468) with Sequenase Version 2.0 (United States 
Biochemicals , Cleveland, Ohio). The entire sequence of a 
3,5Kb region including the eae gene was determined using 
oligonucleotide primers where necessary. Sequence analysis 
was performed using the Wisconsin Genetics Computer Group 
sequence analysis software package version 6.0 (Devereux, 
J., Haeberli, P., and Smithies, 0. (1984) Nucleic Acids 
Res. 12, 387-395). 

Nucleotide sequence and accession number 

The DNA sequence data was submitted to the EMBL database 
and assigned the accession number X60439. 

Of the plaques screened, several hybridized to the EPEC 
eae probe on primary screening and one designated A.GEM-11 
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D4-4 with an insert of 14Kb, was chosen for furtg^'J/^^lc^g 
The fragments carrying the EHEC eae homologue were 
identified by hybridization to the 1Kb EPEC probe. The 
entire predicted structural gene was sequenced including 

5 205 bp upstream and 118 bp downstream. Figure 1 shows the 

DNA sequence of the EHEC eae gene and the deduced protein 
sequence starting at the start codon at 206 bp. The end of 
the open reading frame is denoted by an asterisk. The 
sequence was compared with that of the EPEC eae gene. 

10 The EPEC and EHEC sequences are virtually identical for 

the first 2200 bp of the structural gene and for 
approximately 200 bp upstream of the start site. However, 
there is considerable divergence in the last 800 bp where 
the similarity is only 59% (Figure 2). 

15 

The amino acid sequence alignment of the predicted EPEC 
and EHEC proteins is shown in Figure 3 . The positions of 
the terminal cysteine residues are denoted by asterisks. 
The N-terminal amino acids are highly conserved for the 
20 first 685 amino acids while for the C- terminal amino 

acids, only short stretches can be aligned. There is 
conservation of cysteine residues at positions 859 and 933 
of the alignment. 

25 Jerse et al. first noted the similarity between the 

central regions of eae gene of EPEC and the inv gene of 
Yersinia pseudotuberculosis (Jerse AE, Yu J, Tall BD, 
Kaper, J.B. Proc Natl Acad Sci USA 1990; 87s7839-43). The 
invasin protein, a product of the inv gene is an outer 

30 membrane protein of Y. pseudotuberculosis which has been 

extensively studied by Isberg et al. (Isberg, R.R. , 
Voorhis, D.L, and Palkow, S. (1987) Cell. 50,769-778). 
When expressed in E.coli K12, the inv gene is necessary 
and sufficient for invasion of HEp-2 cells in tissue 

35 culture (Isberg, R.R., and Falkow, S. (1985) Nature. 317, 

262-264). The invasin protein has been divided into three 
domains on the basis of mutational analysis and the study 
of fusion proteins (Isberg, R.R. (1989) Mol. Microbiol. 
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3,1449-1453). The amino terminus is important for export 
beyond the inner membrane while the central region is 
critical for stable localization in the outer membrane. 
The last 192 residues at the carboxyl terminus have been 
shown to be responsible for binding to j31 integrin 
receptors on the eukaryotic cell surface (Leong, J.M. , 
Pournier, S., and Isberg, R.R. (1990) EMBO. Jnl . 9,1979- 
1989). a?his domain also carries the epitopes recognized by 
neutralizing monoclonal antibodies . Examination of the two 
sequences published for the inv genes of Y. enterocolitica 
(Miller, V.L. and Falkow, S. (1988) Infect. Immun. 
56,1242-1248) and the Y. pseudotuberculosis 
(Isberg RR, Voorhis DL, and Falkow S. Cell 1987; 50s769- 
78) showed that there was similarity to the eae genes in 
the central one third of the sequence. The predicted 
protein products of the two inv genes also possess a 
subterminal cysteine residue with a second approximately 
80 amino acids upstream. This suggests that the carboxyl 
termini of all four proteins contain a disulfide loop 
which may be necessary for biologic activity. 

On the basis of the similarities between the inv and eae 
genes, the carboxyl termini of the eae proteins are 
predicted to be the receptor binding domains. It is likely 
that the dissimilarities in this region of the molecules 
are responsible for antigenic variation and possibly for 
differences in receptor binding specificity. Sherman et al 
(Sherman, P., Cockerill III, F., Soni, R. , and Brunton, J. 
(1991) Infect. Immun. 59,890-899) showed that antiserum 
raised against a 94 kilodalton outer membrane protein of 
an 0157 :H7 strain inhibited adherence and AE lesion 
formation by the homologous EHEC strain but had no effect 
on the adherence of the EPEC strain E2348/69. Differences 
in receptor specificity could explain the differences 
observed in intestinal colonization by EPEC and EHEC 
strains. SPEC colonize the entire intestine in animal 
models and the upper small bowel of humans . In 
contrast , EHEC strains colonize the cecum and large 
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intestine in animal models while in humans they cause 
intense proximal colonic inflammation, characteristic of 
hemorrhagic colitis (Knutton, S., Baldwin, T., Williams, 
P,H., and McNeish, A. S. (1989) Infect. Immun. 57, 1290- 

5 1298; Tzipori, S., Wachsmuth, I., Smithers, J., and 

Jackson, C, (1988) Gastroenterology 94, 590-597; 
Tzipori, S., Wachsmuth, I.K., Chapman, C, Birner, R., 
Brittingham, J., Jackson, C, and Hogg, J. (1986) J. 
Infect. Dis. 154, 712-716; Riley, L. (1987) Annu. Rev. 

10 Microbiol. 41, 383-407; Karmali, M. (1989) Clin. 

Microbiol. Rev. 2, 15-38). Recent reports suggest that 
adhesions encoded by the high molecular weight plasmids of 
EPEC and EHEC strains are important in determining the 
specificity of adherence to tissue culture cells 

15 (Baldini, M.M., Kaper, J.B., Levine, M.M. , Candy, D.C.A., 

and Moon, H.W. (1983) J. Ped. Gastroenterol. Nutr. 2,534- 
538; Karch, H. , Hessemann, J., Laufs, R. , O'Brien, A.D., 
Tacket, CO., and Levine, M.M. (1987) Infect. Immun. 
55,455-461; Cantey, J.R., and Moseley, S.L, (1991) Infect. 

20 Immun. 59, 3924-3929; Jerse, A.E., Gicquelais, K.G. , and 

Kaper, J.B. (1991) Infect . Immun . 59, 3869-3875). It 
remains to be seen whether differences in the C-terminal 
regions of the eae gene products of different EPEC and 
EHEC strains might result in alterations of receptor 



EXAMPLE 2 

Serotype Distribution of the EHEC eae Gene Among 
Verotoxin-Producing E.coli . 

30 

Probes based on the EHEC eae gene were used to study the 
serotype distribution of the eae homologue in a collection 
of verotoxin-producing E.coli isolated from human animal 
and food sources using colony hybridization. Colony 
35 hybridizations were carried out using standard techniques 

under highly stringent conditions (Sambrook J, Fritch EF, 
Maniatis T. In: Molecular cloning, A laboratory 
Manual, 1989. (Nolan C, Ed.), Cold Spring Harbor Laboratory 
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Press, Cold Spring Harbor, NY.). DNA for probes was made 
by PCR amplification using strain 0157 sH? or strain 
0111 :H8. The primers used to amplify the probes are set 
forth in Table 1 and are also shown in the Sequence 
Listing as SEQ ID NOS: 2 to 9 . DNA fragments used as 
probes were labeled with [a^^P] dATP using the Random 
Primed DNA Labeling Kit (Boehringer Mannheim) . 
Oligonucleotide probes were end-labeled with [y^^P] dATP 
using T4 polynucleotide kinase (Sambrook J, Fritch EF, 
Maniatis T. Ins Molecular cloning, A laboratory 
Manual, 1989. (Nolan C, Ed.), Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY.). 

DNA for sequencing was made by PCR amplification using 
primers based on sequences within the 0157 sH? eae gene. 
Sequencing was carried out using the ds DNA Cycle 
Sequencing System (BRL Life Technologies) according to the 
manufacturer ' s instructions . 

The results of the colony blot hybridizations using probes 
based on the 0157 sH? EHEC eae gene are shown in Table 2 
and Figure 4. The results show that whereas the central 
1 kb fragment C1-C2 hybridized to a variety of the O 
serogroups of VTEC tested, a 0.5 kb fragment A3-B2 in the 
carboxyl -terminal region only hybridized to 0157 strains. 
Eae homologues among VTEC were detected in serogroups 
0157, 0111, 026, 05, O103, 0145 and 0118 but not 0113 and 
0156. A 0.5 kb fragment (A3-B2) derived from the less 
conserved 3' region hybridized with serogroups 0157 and 
occasional 0111 strains. However, a more specific 0.45 kb 
probe (P1EH-P2EH) only hybridized with VTEC of serogroup 
0157 (Figure 5) . 

Example 3 

Oligonucleotide primers specific to the EHEC eae gene were 
used in polymerase chain reaction assays to test 150 VTEC 
of diverse serogroups which were isolated from food, 
animal and human specimens. In particular, previously 
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characterized and serotyped isolates of verotoxin- 
producing E .coli and enteropathogenic E.coli strains 
derived from human, food and animal sources were 
contributed by A.Borczyk, C. Gyles and M.Karmali, The 

5 Hospital for Sick Children, Toronto, Canada. Several 

colonies were selected, resuspended in 50nl PCR buffer 
(XI) (Perkin Elmer Cetus) and boiled for 10 mins prior to 
use as template DNA in PGR amplifications. The PCR 
reaction conditions were as follows: 50|il reactions were 

10 set up using ImM deoxy nucleotide triphosphates, IX PCR 

Buffer, 1 unit Tag polymerase (GeneAmp Kit, Perkin Elmer 
Cetus), 20 pmol of each primer and lul sample preparation. 
Each cycle (X40) consisted of the following: 
denaturation @ 94°C X 1 min 

15 annealing 8 55°C X 1 min 

extension @ 72°C X 1 min. 

Primers based on the 0157 :H7 sequence and the 0111 
sequence, (primers PIO and P20) were used to amplify DNA 
20 from the various serogroups. The primers are shown in 

Table 1. The location of the A3, B2, PIEH, and P2EH 
primers on the 0157 :h7 eae gene are shown in Figure 6. 

Colony blot hybridization using [y^^P] dATP oligonucleotide 
25 probes was used to confirm the specificity of the primers. 

Colony blot hybridizations were performed as described in 
Example 2 . 

Verotoxin-producing E.coli of serogroups 0157 and 0111 
30 strains were detected by PCR with primers A3-B2, giving a 

0.5 kb fragment. Both primer sets A3-B2 and P1EH-P2EH did 
not detect non verotoxin-producing E .coli of the 
serogroups 0157 and 0111. Primer set P1EH-P2EH 
specifically detected only 0157 verotoxin positive strains 
35 giving a 0.45 kb fragment (Table 3, Figure 7). Figure 7 

shows the amplification products of VTEC and EPEC using 
specific primer sets A3-B2 (0.5 kb) and PIEH-P2EH (0.45kb) 
and in particular the following: 
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LANES 1,20:MW STANDARDS 

LANE 3s0157:H7+ 

LANE 5:0111:H8+ 

LANE 7:0111:NM+ 

A3-B2 PRIMERS 

LANE SsOSsNM - 

LANE 9s0127:H6~ 

LANE 13:025sHll- 

LANES 10,llsNEG CONTROLS 

LANE 14S0157SH7+ 

LANE 15:0111iH8- 

P1EH-P2EH PRIMERS 

LANE ISsOlllsNM- 

LANE 19:NEG CONTROL 

Colony blot hybridization using [y^^P] dATP oligonucleotide 
probes confirmed the specificity of primers P1EH-P2EH data 
obtained from the PGR. Only verotoxin-producing 0157 
serogroups hybridized with PIEH and P2EH (Figure 8). 

Attempts to amplify DNA from serogroups 05, 025 and 0118 
using primers based on the 0157 sH7 sequence were 
unsuccessful. The only serogroup which gave a PCR 
amplification product was an 0111 sH8 strain using primers 
2214 and 3011. Comparisons of the C-terminal ends of the 
eae genes from an 0157 sH7, 0111 sH8, an EPEC strain and the 
1. pg^vdQtwberculosig las gene are shown in Figure 9 . 

Based on the 0111 sequence, primers PIO and P20 were 
designed which gave a PCR product for 0111 sNM and 0111 sHB 
verotoxin-producing {VT+) strains but not 0111 sHll 
indicating that there are differences among (VT+) 0111 
serotypes. However, non verotoxin-producing (VT-) 0111 sH2 
and 0111 :H12 strains were negative. 

The present invention has been described in detail and 
with particular reference to the preferred embodiments; 
however, it will be understood by one having ordinary 
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skill in the art that changes can be made thereto without 
departing from the spirit and scope thereof. 

Foirming part of the present disclosure are the appended 
5 sequence listings. 
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PRIMERS 

CI 5'ATG GAA TTC TCG TCA CAG a?TG ChG 6CC TGG T 3 ' sense 

C2 5'ATG GAA TTC C6A AGT CTT ATC A6C CGT AAA GT 3'antisense 

A3 5'ATG GAA TTC 6GA TGT TCA ACG GTA AGT CT 3' sense 

B2 5'ATG GAA TTC ACA ATA CCG TCT GTG TGG AT 3' antisense 

2214 5'ATG G6A TCC CCG ATA TCG CAG TAG CAA ATG GTA AGG AT 3 ' sense 

3011 5'ATG AAG CTT GAA TTC TCT ACA CAA ACC GCA T 3' antisense 

PIEH 5 'AAG CGA CTG AGG TCA CT 3' sense 

P2EH 5 'ACG CTG CTC ACT AGA TGT 3 ' antisense 

PIO* 5 'AGA CCT AGG TTA CAT TT.. 3' sense 
P20* 5 'TAT TTT ATC AGC TTC AGT 3 ' antisense 



~ 34 - 



TABLE 2 



2078710 



SEROGROUP DISTHIBUTION OF EAE-HOMOLOGUES 
AMOm yEROTOKm-PRODUCmG 
PROBE 





C1-C2 


SEROGROUP 


(LOkD) 


01 57 


67 /@7 


0111 


11/11 


026 


It / 12 


05 


7/ 7 


01 03 


4/ 4 


0145 


5/ 5 


01 18 


1/ 1 


0113 


0/ 5 


0156 


0/ 4 


0127:H6 

(E2348) 


1/ 1 


015:H- 
(RDEC1) 


1/1 



A3-B2 P1EH-P2EH 

(O-Skb) (0.45 kb) 

18/18 7/ 7 

2/10* 0/ 5 

0/ 7 0/ 2 

0/6 0/ 1 

0/ 3 0/ 1 
0/13 

0/ 1 0/ 2 

0/ 4 0/ 1 
0/ 4 

0/ 1 0/ 1 

0/ 1 0/ 1 



- 35 - 



TABLE 3 



2078716 



DETECTlO^i OF ©1S7-.HT E^WUi BY PCB 

ysiNQ sPEGiFic pumms 

PRIISEB SET 
^B2 P1EH-P2EH 



SEROGROUPS 




#Pos / #1 


0157 (VT4-) 


IS/li 


18/18 


01 57 (VT-) 


0/ a 


0/ 8 


0111 (VT+) 

(N!^,H8,H11) 


17/18 


0/12 


0111 (VT-) 
(H2,H12) 


0/ 2 


0/ 2 


026 


0/ 6 


0/ 2 


VTEC* 


0/27 


0/16 


ETEC** 


0/ s 


0 / 3 


EPEC*** 


©/ 4 


0/ 1 


ElEC 

(0136) 


©/ 1 


0/ 1 



* VTEC (01. 02, 05. 07. 016. 018. 022, 038, 040. 042. 091. O103, 0113, 
0117. 01 18, 0121, 0145. OR, 07) 

** ETEC (06, 0115, Ol*^, 0153) 

***EPEC (015, 0119, 0126, 0127) 
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SEQUENCE LISTING 

1. GENERAL INFORMATION 

(a) APPLICANTS Joyce de Azavedo, James Brunt on 
and Marie Louie 

(b) TITLE OF INVENTION: Attaching and Effacing 
Protein of Enterohemorrhagic E .coli 

(c) NUMBER OF SEQUENCES : 9 

2. CORRESPONDENCE ADDRESS; 

(a) ADDRESSEE: Linda M. Kurdydyk, 
Bereskin & Parr 

(b) STREET: Box 401, 40 King Street West 

(c) CITY: Toronto, Ontario 

(d) COUNTRY: Canada 

(e) ZIP CODE: M5H 3Y2 

3. COMPUTER READABLE FORM: 

(a) MEDIUM TYPE: Floppy disk 

(b) COMPUTER: IBM PC compatible 

(c) OPERATING SYSTEM: PC-DOS /MS-DOS 

(d) SOFTWARES DOS Text File 

4. CURRENT APPLICATION DATA: 

(a) APPLICATION NUMBER: Unknown 

(b) FILING DATE: Unknown 

(c) CLASSIFICATION: Unknown 

5. ATTORNEY /AGENT INFORMATION: 

(a) NAME: Linda M. Kurdydyk 

(b) REGISTRATION NUMBER: 34,971 

( c ) REFERENCE NUMBER /DOCKET NUMBER : 3153-057 /LMK 

6. TELECOMMUNICATION INFORMATION: 

(a) TELEPHONE : (416) 364-7311 

(b) TELECOPY: (416) 361-1398 

7. INFORMATION FOR SEQ ID N0:1: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 3131 base pairs 

(ii) TYPE:nucleic acid 

(iii) STRANDEDNESS : double 

(iv) TOPOLOGY: linear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



TCGAGAATGAAATAGAAGTCGTTGTTAAGTCAATGGAAAACCTGTATTTGGTATTACATA 

: — +- + -+ .+— + — — + 60 

AGCTCTTACTTTATCTTCAGCAACAATTCAGTTACCTTTTGGACATAAACCATAATGTAT 
SRMK*KSLLSQWKTCIWYYI 
RE*NRSRC*VNGKPVFGIT* - 
ENE lEVVVKSMENLYLVLHN- 
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ATCAGGGAATAACATTAGAAAACGAACATATGAAAATAGAGGAiyi,TCAGTTCAAGCGACA 
61 + ~ + __+__ + +_____ .+ 120 

TAGTCCCTTATTGTAATCTTTTGCTTGTATACTTTTATCTCCTTTAGTCAAGTTCGCTGT 
a IRE*H*KTNI*K*RKS VQAT 
b SGNNIRKRTYENRGNQFKRQ- 
c QGITLENEHMKIEEISSSDN- 

ATAAACATTATTACGCCGGAAGATAAAATCCGATCTATTAATATAATTTATTTCTCATTC 
121 + — — + -+ + — + .— 180 

TATTTGTAATAATGCGGCCTTCTATTTTAGGCTAGATAATTATATTAAATAAAGAGTAAG 
a INIITPEDKIRSINIIYFSF 
b *TLLRRKIKSDLLI*FISHS- 
C KHYYAGR*NPiy*YNLFLIL- 

TAACTCATTGTGGTGGAGCCATAACATGATTACTCATGGTTGTTATACCCGGACCCGGCA 

181 — — _+ + .+ + + — + 240 

ATTGAGTAACACCACCTCGGTATTGTACTAATGAGTACCAACAATATGGGCCTGGGCCGT 

a *LIVVEP*HDYSWLLYPDPA 

b NSLWWSHNMITHGCYTRTRH- 

c THCGGAIT*LLMVVIPGPGT- 



CAAGCATAAGCTAAAAAAAACATTGATTATGCTTAGTGCTGGTTTAGGATTGTTTTTTTA 
241 + + +— + + + 300 

GTTCGTATTCGATTTTTTTTGTAACTAATACGAATCACGACCAAATCCTAACAAAAAAAT 
a QA*AKKNIDYA*CWFRIVPL 
b KHKLKKTLIMLSAGLGLFFY- 
c SIS*KKH*LCLVLV*DCFFM- 

TGTTAATCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGGGTTCGGATTCAAA 
301 + + +- .-+ • 1- -+ 350 

ACAATTAGTCTTAAGTAAACGTTTACCACTTTTAATAAAATTTAACCCAAGCCTAAGTTT 
a C*SEFICKW*KLF*IGFGFK 
b VNQNSFANGENYPKLGSDSK- 
C LIRIHLQMVKI ILNWVRIQN- 

ACTGa?TAACTCATGATAGCTATCAGAATCGCCTTTTTTATACGTTGAAAACTGGTGAAAC 

361 . +___ + + . -+ + + 420 

TGACAATTGAGTACTATCGATAGTCTTAGCGGAAAAAATATGCAACTTTTGACCACTTTG 

a TVNS**LSESPFLYVENW*N 

b liLTHDSYQNRLFYTLKTGET- 

C C*LMIAIRIAFFIR*KLVKL- 
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TGTTGCCGATCTTTCTAAATCGCAAGATATTAATTTATCGACGATTTGGTCGTTGAATAA 
421 „____+ +-. + +— + + 480 

ACAACGGCTAGAAA6ATTTAGCGTTCTATAATTAAATAGCTGCTAAACCAGCAACTTATT 
a CCRSF*IARY*FIDDLVVE* 
b VADLSKSQDINLSTIWSLNK- 
c IiPIFLNRKILIYRRPGR*IS~ 

GCATTTATACAGTTCTGAAAGCGAAATGATGAAGGCCGCGCCTGGTCAGCAGATCATTTT 
481 + — + + . — + + + 540 

CGTAAATATGTCAAGACTTTCGCTTTACTACTTCCGGCGCGGACCAGTCGTCTAGTAAAA 
a AFIQF*KRNDEGRAWSADHF 
b HLYSSESEMMKAAPGQQIIL- 
c IYTVLKAK**RPRLVSRSFC- 

GCCACTCAAAAAACTTCCCTTTGAATACAGTGCACTACCACTTTTAGGTTCGGCACCTCT 
541 + — — + ____+ + + 600 

CGGTGAGTTTTTTGAAGGGAAACTTATGTCACGTGATGGTGAAAATCCAAGCCGTGGAGA 
a ATQKTSL*IQCTTTFRFGTS 
b PLKKLPFEYSALPLLGSAPL- 
c HSKNFPLNTVHYHF*VRHLL- 

TGTTGCTGCAGGTGGTGTTGCTGGTCACACGAATAAACTGACTAAAATGTCCCCGGACGT 
501 +— _. + +— — + + + 660 

ACAACGACGTCCACCACAACGACCAGTGTGCTTATTTGACTGATTTTACAGGGGCCTGCA 
a CCCRWCCWSHE*TD*NVPGR 
b VAAGGVAGHTNKLTKMSPDV- 
C LLQVVLLVTRIN*LKCPRT*- 

GACCAAAAGCAACATGACCGATGACAAGGCATTAAATTATGCGGCACAACAGGCGGCGAG 
661 — + + + + + + 720 

CTGGTTTTCGTTGTACTGGCTACTGTTCCGTAATTTAATACGCCGTGTTGTCCGCCGCTC 
a DQKQHDR*QGIKLCGTTGGE 
b TKSNMTDDKALNYAAQQAAS- 
c PKAT*PMTRH*IMRHNRRRV- 

TCTCGGTAGCCAGCTTCAGTCGCGATCTCTGAACGGCGATTACGCGAAAGATACCGCTCT 
721 +- + — + ■— + — — + + 780 

AGAGCCATCGGTCGAAGTCAGCGCTAGAGACTTGCCGCTAATGCGCTTTCTATGGCGAGA 
a SR*PASVAISERRLRERYRS 
b LGSQLQSRSLNGDYAKDTAL- 
C SVASFSRDL*TAITRKIP IiL- 
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TGGTATCGCTGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAAC 
731 — — +- -. +— +— — + + 840 

ACCATAGCGACCATTGGTCCGAAGCAGTGTCAACGTCCGGACCAATGTTGTAATACCTTG 
a WYRW*PGFVTVAGLVTTLWN 
b GIAGNQASSQLQAWLQHYGT- 
c VSLVTRLRHSCRPGYNIMER- 

GGCAGAGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATT 
841 + + + + + 900 

CCGTCTCCAATTAGACGTCTCACCATTATTGAAACTGCCATCAAGTGACCTGAAGAATAA 
a GRG*SAEW**L*R*FTGLLI 
b AEVNLQSGNNFDGSSLDFLL- 
C QRLICRVVITLTVVHWTSyy- 



ACCGTTCTATGATTCCGAAAAAATGCTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGA 
901 . +______, + .____+___.^ __+ + 960 

TGGCAAGATACTAAGGCTTTTTTACGACCGTAAACCAGTCCAGCCTCGCGCAATGTAACT 
a TVL*FRKNAGIWSGRSALH* 
b PFYDSEKMLAPGQVGARYID- 
c RSMIPKKCWHLVRSERVTLT- 



CTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCTTCCTGCAAACATGTT 
961 — + — + + + + + 1020 

GAGGGCGAAATGCCGTTTAAATCCAGGCCCAGTCGCAAAAAAGGAAGGACGTTTGTACAA 
a LPLY GKFRCGS AFFPSCKHV 
b SRFTANLGAGQRFFLPANML- 
C PALRQI*VRVSVFSFLQTCW- 

GGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGG 
1021 +~ -+~ — + — + +- + 1080 

CCCGATATTGCAGAAGTAACTAGTCCTAAAAAGACCACTATTATGGGCAAATCCATAACC 
a GL*RLH*SGFFW**YPFRYW 
b GYNVPIDQDPSGDNTRLGIG- 
c AITSSLIRIFLVIIPV*VLV- 

TGGCGAATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGG 
1081 + + + + — + + 1140 

ACCGCTTATGACCGCTCTGATAAAGTTTTCATCGCAATTGCCGATAAAGGCGTACTCGCC 
a WRILARLFQK*R*RLFPHER 
b GEYWRDYFKSSVNGYFRMSG- 
c A NTGETISKVALTAISA*AA- 
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CTGGCATGAGTCATACAATAAGAAAGACTATGATGAGCGCCCAGCAAATGGCTTCGATAT 
1141 + — -+ -+ + .— +~ .- + 1200 

GACCGTACTCAGTATGTTATTCTTTCTGATACTACTCGCGGGTCGTTTACCGAAGCTATA 
a LA*VIQ*ERL**APSKWLRY 
b WHESYNKKDYDERPANGFDI- 
C GMSHTIRKTMMSAQQMASIS- 



CCGTTTTAATGGCTATCTACCGTCATATCCGGCATTAGGCGCCAAGCTGATATATGAGCA 
1201 + + + +- — + + 1260 

GGCAAAATTACCGATAGATGGCAGTATAGGCCGTAATCCGCGGTTCGACTATATACTCGT 
a PF*WLSTVISGIRRQADI*A 
b RFNGYLPSYPALGAKLIYEQ- 
C VLMAIYRHIRH*APS*YMSS- 

GTATTATGGTGATAATGTTGCTTTGTTTAATTCTGATAAGCTGCAGTC6AATCCTGGTGC 
1261 + +_____-_—+——____+— + 1320 

CATAATACCACTATTACAACGAAACAAATTAAGACTATTCGACGTCAGCTTAGGACCACG 
a VLW**CCFV*F**AAVESWC 
b YYGDNVALFNSDKLQSNPGA- 
C IMVIMLLCLILISCSRILVR- 



GGCGACCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCG 
1321 ■ + + -+— — + — +• + 1380 

CCGCTGGCAACCACATTTGATATGAGGCTAAGGAGACCACTGCTACCCCTAGCTAATGGC 
a GDRWCKLYSDSSGDDGDRLP 
b ATVGVNYTPIPLVTMG IDYR- 

c RPLV*TILRFLW*RWG SITV- 

TCATGGTACGGGTAATGAAAATGATCTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGA 
1381 + + — 1- + + 1440 

AGTACCATGCCCATTACTTTTACTAGAGGAAATGAGTTACGTCAAGGCAATAGTCAAACT 
a SWYG**K*SPLLNAVPLSV* 
b HGTGNEND LLYSMQFRYQFD- 

c MVRVMKMISFTQCSSVISLI- 

TAAATCGTGGTCTCAGCAAATTGAACCACAGTATGTTAACGAGTTAAGAACATTATCAGG 
1441 + + .-- .+ . + + + 1500 

ATTTAGCACCAGAGTCGTTTAACTTGGTGTCATACAATTGCTCAATTCTTGTAATAGTCC 
a *ivVSAN*TTVC*RVKNIIR 
b KSWSQQIEPQYVNELRTLSG- 
c NRGLSKLNHSMLTS*EHyQA- 
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CAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAGAAGCAGGA 
1501 +—. + .+ — + + + 1.560 

GTCGGCAATGCTAGACCAAGTCGCATTATTGTTATAATAAGACCTCATGTTCTTCGTCCT 
a QPLRSGSA**QYYSGVQEAG 
b SRYDLVQRNNNIILEYKKQD- 
C AVTIWFSVITILFWSTRSRI- 



TATTCTTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGAT 
1561 + + + + + + 1620 

ATAAGAAAGAGACTTATAAGGCGTACTATAATTACCATGACTTGTGTCATGCGTCTTCTA 
a YS FSEYSA*Y*WY*TQYAED 
b ILSLNIPHDINGTEHSTQKI- 
C PFL*IFRMILMVLNTVRRRF- 



TCAGTTGATCGTTAAGAGCAAATACGGTCTGGATCGTATCGTCTGGGATGATAGTGGATT 
1621 + +— — I- + ■+ — • + 1680 

AGTCAACTAGCAATTCTCGTTTATGCCAGACCTAGCATAGCAGACCCTACTATCACGTAA 
a SVDR*EQIRSGSYRLG**CI 
b QLIVKSKYGLDRIVWDDSAL- 
C S*SLRANTVWIVSSGMIVHY- 



ACGCAGTCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGCACAAGACTACCAGGC 
1681 — — + + + + + + 1740 

TGCGTCAGTCCCGCCAGTCTAAGTCGTATCGCCTTCGGTTTCGCGTGTTCTGATGGTCCG 
a TQSGRSDSA*RKPKRTRLPG 
b RSQGGQIQHSGSQSAQDYQA- 
c AVRAVRFSIAEAKAHKTTRL- 



TATTTTGCCTGCTTATGTGCAAGGTGGCAGCAATATTTATAAAGTGACGGCTCGCGCCTA 
1741 -+-. -+ . + + — + + 1800 

ATAAAACGGACGAATACACGTTCCACCGTCGTTATAAATATTTCACTGCCGAGCGCGGAT 
a YFACLCARWQQYL*SDGSRL 
b ILPAYVQGGSNIYKVTARAY- 
C FCLLMCKVAAIFIK*RI.APM- 



TGACCGTAATGGCAATAGCTCTAACAATGTACAGCTTACTATTACCGTTCTGTCGAATGG 
1801 + + + •+ + + 1850 

ACTGGCATTACCGTTATCGAGATTGTTACATGTCGAATGATAATGGCAAGACAGCTTACC 
a *p*WQ*L*QCTAYYYRSVEW 
b DRNGNSSNNVQLTITVLSNG- 
c TVMAIALTMYSLLLPFCRMV- 
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TCAAGTTGTCGACCAGGTTGGGGTAACGGACTTTACGGCGGATAAGACTTCGGCTAAAGC 
1861 — — + + —_+»__ + — + 1920 

AGTTCAACAGCTGGTCCAACCCCATTGCCTGAAATGCCGCCTATTCTGAAGCCGATTTCG 
a SSCRPGWGNGLYGG*DFG*S 
b QVVDQVGVTDFTADKTSAK A- 

c KLSTRLG*RTLRRIRLRLKR- 



GGATAACGCCGATACCATTACTTATACCGCGACGGTGAAAAAGAATGGGGTAGCTCAGGC 
1921 + + + -+ + + 1980 

CCTATTGCGGCTATGGTAATGAATATGGCGCTGCCACTTTTTCTTACCCCATCGAGTCCG 
a G*R RYHYLYRDGEKEWGSSG 
b DNADTITYTATVKKNGVAQA- 
c ITPIPLLIPRR*KRMG*LRL- 



TAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAACTCTTGGGGCAAATAGTGC 
1981 +- + +- + — ■ + "-+ 2040 

ATTACAGGGACAAAGTAAATTATAACAAAGTCCTTGACGTTGAGAACCCCGTTTATCACG 
a *CPCFI*YCFRNCNSWGK*C 
b NVPVSPNIVSGTATLGANSA- 
c MSLFHLILFQELQLLGQIVP- 



CAAAACGGATGCTAACGGTAAGGCAACCGTAACGTTGAAGTCGAGTACGCCAGGACAGGT 
2041 — + + + — + + + 2100 

GTTTTGCCTACGATTGCCATTCCGTTGGCATTGCAACTTCAGCTCATGCGGTCCTGTCCA 
a QNGC*R*GNRNVEVEYARTG 
b KTDANGKATVTLKSSTPGQV- 
C KRMLTVRQP*R*SRVRQDRS- 



CGTCGTGTCTGCTAAAACCGCGGAGATGACTTCAGCACTTAATGCCAGTGCGGTTATATT 
2101 — _+ +— . — + .— +— — + + 2160 

GCAGCACAGACGATTTTGGCGCCTCTACTGAAGTCGTGAATTACGGTCACGCCAATATAA 
a RRVC*NRGDDFST*CQCGYI 
b VVSAKTAEMTSALNASAVIF- 
C SCLLKPRR*LQHLMPVRLYF- 



TTTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAAGACAACTGCAGTAGC 
2161 +____—__-+ _.___+___—__-+ — + — — + 2220 

AAAACTAGTTTGGTTCCGGTCGTAATGACTCTAATTCCGACTATTCTGTTGACGTCATCG 
a F*SNQGQHY*D*G**DNCSS 
b FDQTKASITEIKADKTTAVA- 
c LIKPRPALLRLRLIRQLQ*Q- 
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iiAATGGTAAGGATGCTATTAAATATACTGTAAAAGTTATGAAAAACGGTCAGCCAGTTAA 
2221 + + •+ + + + 2280 

TTTACCATTCCTACGATAATTTATATGACATTTTCAATACTTTTTGCCAGTCGGTCAATT 
a KW*GCY*IYCKSYEKRSAS* 
b NGKDAIKYTVKVMKNGQPVN- 
C MVRMLLNIL*KL*KTVSQLI- 



TAATCAATCCGTTACATTCTCAACAAACTTTGGGATGTTCAACGGTAAGTCTCAAACGCA 
2281 — + — + — — + +- — -+ 2340 

ATTAGTTAGGCAATGTAAGAGTTGTTTGAAACCCTACAAGTTGCCATTCAGAGTTTGCGT 
a *SIRYILNKLWDVQR*VSNA 
b NQSVTFSTNFGMFNGKSQTQ- 
c INPLHSQQTLGCSTVSLKRK- 



AGCAACCACGGGAAATGATGGTCGTGCGACGATAACACTAACTTCCAGTTCCGCCGGTAA 
2341 + + + • + + + 2400 

TCGTTGGTGCCCTTTACTACCAGCACGCTGCTATTGTGATTGAAGGTCAAGGCGGCCATT 
a SNHGK*WSCDDNTNPQFRR* 
b ATTGNDGRATITLTSSSAGK- 
c QPREMMVVRR*H *LPVPPVK- 

AGCGACTGTTAGTGCGACAGTCAGTGATGGGGCTGAGGTTAAAGCGACTGAGGTCACTTT 
2401 ■ — - — + + • +~ — — — -+- •+ + 2460 

TCGCTGACAATCACGCTGTCAGTCACTACCCCGACTCCAATTTCGCTGACTCCAGTGAAA 
a SDC*CDSQ*WG*G*SD*GHF 
b ATVSATVSDGAEVKATEVTF- 
C RLLVRQSVMGL RLKRLRSLF- 

TTTTGATGAACTGAAAATTGACAACAAGGTTGATATTATTGGTAACAATGTCAAGAGGTC 
2461 + + — • ■ + — — + . — + + 2520 

a F**TEN*QQG*YYW*QCQEV 

b FDELKIDNKVDIIGNNVKRS- 

c LMN*KLTTRLILLVTMSRGR- 

GATGTTGCCTAATATTTGGCTGCAATATGGTCAGTTTAAACTGAAAGCAAGCGGTGGTGA 
2521 +— + — + ■ -+ + + 2580 

CTACAACGGATTATAAACCGACGTTATACCAGTCAAATTTGACTTTCGTTC6CCACCACT 
a DVA*YLAAIWSV*TESKRW* 
b MLPNIWLQYGQFKLKASGGD- 
c CCLIFGCNMVSLN*KQAVVM- 
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'iGGTACATATTCATGGTATTCAGAAAATACCAGTATCGCGACTGTCGATGCATCAGGGAA 
2581 ■-+ + + +-• + + 2640 

ACCATGTATAAGTACCATAAGTCTTTTATGGTCATAGCGCTGACAGCTACGTAGTCCCTT 
a WYIFMVFRKYQYRDCRCIRE 
b GTYSWYSENTSIATVDASGK- 
c VHIHGIQKIPVSRLSMHQGK- 

AGTCACTTTGAATGGTAAAGGCAGTGTCGTAATTAAAGCCACATCTGGTGATAAGCAAAC 
2641 + + + + +- + 2700 

TCAGTGAAACTTACCATTTCCGTCACAGCATTAATTTCGGTGTAGACCACTATTCGTTTG 
a SHFEW*RQCRN*SHIW**AN 
b VTLNGKGSVVIKATSGDKQT- 
c SL*MVKAVS*LKPHLVISKQ- 



AGTAAGTTACACTATAAAAGCACCGTCGTATATGATAAAAGTGGATAAGCAAGCCTATTA 
2701 — — + +— + + ■ +— — + 2760 

TCATTCAATGTGATATTTTCGTGGCAGCATATACTATTTTCACCTATTCGTTCGGATAAT 
a SKLHYKSTVVYDKSG*ASLL 
b VSYTIKAPSYMIKVDKQAYY- 
C *VTL*KHRRI * *KWISKPIM- 

TGCTGATGCTATGTCCATTTGCAAAAATTTATTACCATCCACACAGACGGTATTGTCAGA 
2761 + + • + + + + 2820 

ACGACTACGATACAGGTAAACGTTTTTAAATAATGGTAGGTGTGTCTGCCATAACAGTCT 
a C*CYVHLQKFITIHTDGIVR 
b ADAMSICKNLLPSTQTVLSD- 
c LMLCPFAKIYYHPHRRYCQI- 

TATTTATGACTCATGGGGGGCTGCAAATAAATATAGCCATTATAGTTCTATGAACTCAAT 
2821 + + — + + + + 2880 

ATAAATACTGAGTACCCCCCGACGTTTATTTATATCGGTAATATCAAGATACTTGAGTTA 
a YL*LMGGCK*I*PL*FYELN 
b lYDSWGAANKYSHYSSMNSI- 
C FMTHGGLQINIAIIVL*TQ*- 

AACTGCTTGGATTAAACAGACATCTAGTGAGCAGCGTTCTGGAGTATCAAGCACTTATAA 
2881 + + — — + ■+ +~ + 2940 

TTGACGAACCTAATTTGTCTGTAGATCACTCGTCGCAAGACCTCATAGTTCGTGAATATT 
a NCLD*TDI**AAFWSIKHL* 
b TAWIKQTSSEQRSGVSSTYN- 
C LLGLNRHLVSSVLEYQALIT- 
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LCTAATAACACAAAACCCTCTTCCTGGGGTTAATGTTAATACTCCAAATGTCTATGCGGT 
2941 + — • + + + + — — — + 3000 

GGATTATTGTGTTTTGGGAGAAGGACCCCAATTACAATTATGAGGTTTACAGATACGCCA 
a PNNTKPSSWG*C*YSKCLCG 
b LITQNPLPGVNVNTPNVYAV- 
c * *HKTLFLGLMLILQMSMRF- 



TTGTGTAGAATAATTCCATAACCACCCCGGCTAAAATATGTATTGTTTTAGTCGGGGCAT 
3001 + + -+ + + + 3050 

AACACATCTTATTAAGGTATTGGTGGGGCCGATTTTATACATAACAAAATCAGCCCCGTA 
a LCRIIP*PPRLKYVLF*SGH 
b CVE*FHNHPG*NMYCFSRGI- 
C V*NNSITTPAKICIVLVGA*- 



AATTATTTCTTCTTAAGAAATAACCCTCTTATAATCAAATCTACTACTGGTCTTTTTATC 
3061 + — + -+ +-• + — — + 3120 

TTAATAAAGAAGAATTCTTTATTGGGA6AATATTAGTTTAGATGATGACCAGAAAAATAG 
a NYFFLRNNPLIIKSTTGLFI 
b IISS*EITLL*SNLLLVFLS- 
c LFLLKK*PSYKQIYYWSFYL- 



TGCTTAATAGG 

3121 +- 3131 

ACGAATTATCC 

a C L I 

b A * * 

c L N R - 

8. INFORMATION FOR SEQ ID NO: 2; 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTHS 31 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS ! single 

(iv) TOPOLOGY: linear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



5'ATG GAA TTC TCG TCA CAG TTG CAG GCC TGG T 3 



INFORMATION FOR SEQ ID NOsSs 

(a) SEQUENCE CHARACTERISTICS! 

(i) LENGTH: 32 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS : single 

( iv ) TOPOLOGY : 1 inear 

(b) MOLECULAR TYPES DNA (genomic) 

(c) SEQUENCE DESCRIPTION! SEQ ID NO! 3: 



5 ' ATG GAA TTC CGA AGT CTT ATC AGC CGT AAA GT 3 ' 



INFORMATION FOR SEQ ID NO! 4: 

(a) SEQUENCE CHARACTERISTICS I 

(i) LENGTHS 29 base pairs 

(ii) TYPE! nucleic acid 

(iii) STRANDEDNESS: single 

(iv) TOPOLOGY! linear 

(b) MOLECULAR TYPE; DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



5 'ATG GAA TTC GGA TGT TCA AC6 GTA AGT CT 3' 



INFORMATION FOR SEQ ID NO: 5: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 29 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS: single 

(iv) TOPOLOGY! linear 

(b) MOLECULAR TYPEs DNA (genomic) 

(c) SEQUENCE DESCRIPTION! SEQ ID NOsS: 



5 ' ATG GAA TTC ACA ATA CCG TCT GTG TGG AT 3 ' 
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12. xNFORMATION FOR SEQ ID NO: 6s 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 17 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS: single 

( iv ) TOPOLOGY : 1 inear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

5'AAG CGA CTG AGG TCA CT 3' 

13. INFORMATION FOR SEQ ID N0:7s 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 18 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS: single 

(iv) TOPOLOGY: linear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION! SEQ ID NO: 7: 

5'ACG CTG CTC ACT AGA TGT 3' 

14. INFORMATION FOR SEQ ID NO: 8: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 17 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS J single 

( iv ) TOPOLOGY : 1 inear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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5 ' AGA CCT AGG TTA CAT TT 3 ' 



iNFORMATION FOR SEQ ID NO: 9s 

(a) SEQUENCE CHARACTERISTICS s 

(i) LENGTH: 18 base pairs 

(ii) TYPE: nucleic acid 
(ill) STRANDEDNESS ! single 
(iv) TOPOLOGY: linear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID HO: 8: 
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5 'TAT TTT ATC AGC TTC AGT 3 
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xriE EMBODIMENTS OF THE INVEHTION IN WHICH AN EXCLUSIVE PROPERTY 
OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS! 

1 . A purified and isolated DNA segment having a sequence which 
codes for a protein associated with attaching and effacing 
activity of enterohemorrhagic E.coli . or an oligonucleotide 
fragment of the DNA segment. 

2. A purified and isolated DNA segment as claimed in claim 1 
having a sequence which codes for a protein associated with 
attaching and effacing activity of enterohemorrhagic E .coli which 
has substantial homology with the amino acid sequence as shown 
in the Sequence Listing as SEQ ID N0:1. 

3. A purified and isolated DNA segment as claimed in claim 1 
having a sequence having substantial sequence homology with the 
nucleotide sequence as shown in the Sequence Listing as SEQ ID 
NOs 1. 

4. A purified and isolated DNA segment as claimed in claim 1 
having a sequence having more than ? sequence homology with the 
nucleotide sequence as shown in the Sequence Listing as SEQ ID 
NO!l. 

5. A purified and isolated single stranded nucleotide sequence 
comprising a DNA segment as claimed in claim 2. 

5. An antisense sequence of a DNA segment as claimed in claim 
1 or an oligonucleotide fragment of the antisense sequence. 

7 , A recombinant molecule adapted for transformation of a host 
cell comprising a DNA segment or an oligonucleotide fragment of 
the DNA segment as claimed in claim 1 and an expression control 
sequence operatively linked to the DNA segment. 

8. A transforrnant host cell including a recombinant molecule 
as claimed in claim 7 . 
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s . A method for preparing a protein associated with attaching 
and effacing activity of enterohemorrhagic E.eoli utilizing a 
purified and isolated DNA segment as claimed in claim 2. 

10. A substantially pure protein associated with attaching and 
effacing activity of enterohemorrhagic E.eoli . 

11. A substantially pure protein as claimed in claim 10 having 
an amino acid sequence which has substantial homology with the 
amino acid sequence as shown in the Sequence Listing as SEQ ID 
N0:1. 

12. A protein encoded by the purified and isolated DNA segment 
as claimed in claim 2. 

13. A monoclonal or polyclonal antibody specific for an epitope 
of a protein as claimed in claim 10. 

14. An antibody as claimed in claim 13 which binds a distinct 
epitope in an unconserved region of the protein. 

15. A purified and isolated nucleotide sequence comprising the 
sequence shown in the Sequence Listing as SEQ ID NO: 2 or the 
sequence shown in the Sequence Listing as SEQ ID NOsS, or a 
sequence having substantial homology thereto or a fragment of the 
nucleotide sequence. 

16. A purified and isolated nucleotide sequence comprising the 
sequence shown in the Sequence Listing as SEQ ID N0s4 or the 
sequence shown in the Sequence Listing as SEQ ID NOsS, or a 
sequence having substantial homology thereto or a fragment of the 
nucleotide sequence. 

17 . A purified and isolated nucleotide sequence comprising the 
sequence shown in the Sequence Listing as SEQ ID NOsS or the 
sequence shown in the Sequence Listing as SEQ ID NO: 7, or a 
sequence having substantial homology thereto or a fragment of the 
nucleotide sequence. 
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±0. A purified and isolated nucleotide sequence comprising the 
sequence shown in the Sequence Listing as SEQ ID NOsS or the 
sequence shown in the Sequence Listing as SEQ ID NO: 9, or a 
sequence having substantial homology thereto or a fragment of the 
nucleotide sequence. 

19. A diagnostic kit for detecting enterohemorrhagic E .coli 0157 
in a sample comprising a monoclonal antibody as claimed in claim 
13 and directions for its use. 

20. An immunoassay for the detection of E .coli 0157 comprising 
contacting a sample suspected of containing E.coli 0157 with a 
monoclonal antibody as claimed in claim 13 to form an immune 
complex; and determining the presence of the immune complex in 
order to detect E.coli 0157 in the sample. 

21. An avirulent strain of an enterohemorrhagic E.i.cpli 
comprising an avirulent bacterial carrier strain transformed with 
a recombinant molecule as claimed in claim 7 . 

22. A vaccine composition comprising a bacterial carrier strain 
transformed with a recombinant molecule as claimed in claim 7. 

23. A method for detecting the presence of a DNA segment having 
a sequence encoding a protein related to or analogous to a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E.coli or an oligonucleotide fragment thereof 
in a sample, comprising contacting the sample with a nucleotide 
probe capable of hybridizing with the DNA segment or an 
oligonucleotide fragment thereof, to form a hybridization 
product, under conditions which permit the formation of the 
hybridization product, and assaying for the hybridization 
product . 

24. A kit for detecting the presence of a DNA segment having a 
sequence encoding a protein related to or analogous to a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E.coli or an oligonucleotide fragment thereof. 
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in a sample comprising a nucleotide probe capable of hybridizing 
with the DNA segment or an oligonucleotide fragment thereof, 
reagents required for hybridization of the nucleotide probe with 
the DNA segment or an oligonucleotide fragment thereof, and 
directions for its use. 

25. A method of determining the presence of a DNA segment having 
a sequence encoding a protein associated with attaching and 
effacing activity of enterohemorrhagic E .coli or a predetermined 
oligonucleotide fragment thereof in a sample, comprising treating 
the sample with primers which are capable of amplifying the DNA 
segment or the predetermined oligonucleotide fragment thereof in 
a polymerase chain reaction to form amplified sequences, under 
conditions which permit the formation of amplified sequences , and 
assaying for amplified sequences. 

26. A kit for determining the presence of a DNA segment having 
a sequence encoding a protein associated with attaching and 
effacing activity of enterohemorrhagic E.coll or a predetermined 
oligonucleotide fragment thereof in a sample, comprising primers 
which are capable of amplifying the DNA segment or the 
predetermined oligonucleotide fragment thereof in a polymerase 
chain reaction to form amplified sequences, means for assaying 
the amplified sequences, and directions for its use. 

27. A method for detecting verotoxin-producing E.coli of the 
serogroups 05, 026, 0103, 0111, 0118, 0145, and 0157 in a sample 
comprising contacting the sample with a Sall-StuI fragment (Cl- 
C2) comprising the sequence as shown in the Sequence Listing as 
SEQ ID NO: 2 and the sequence as shown in the Sequence Listing as 
SEQ ID NO: 3, under conditions which permit the fragment to 
hybridize with a complementary sequence in the sample to form a 
hybridization product, and assaying for the hybridization 
product . 

28. A method for detecting verotoxin-producing E.coli serogroup 
0157 in a sample comprising contacting the sample with a 0.5 kb 
fragment (A3-B2) comprising the sequence as shown in the Sequence 
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liisting as SEQ ID N0s4 and the sequence as shown in the Sequence 
Listing as SEQ ID NOsS, under conditions which permit the 
fragment to hybridize with a complementary sequence in the sample 
to form a hybridization product, and assaying for the 
hybridization product. 



29 . A method for detecting pathogenic verotoxin-produclng E.coli 
strains of the serogroup 0157 in a sample comprising treating the 
sample with the nucleotide sequence PIEH shown in the Sequence 
Listing as SEQ ID NO: 4 and the nucleotide sequence P2EH5 shown 
In the Sequence Listing as SEQ ID NOsS, in the polymerase chain 
reaction, under conditions which permit the formation of 
amplified sequences which terminate with the nucleic acid 
sequence of one of the primers and the complementary nucleic acid 
sequence of the other of the primers; and, assaying for the 
amplified sequences. 

30. A kit for detecting verotoxin-produclng E.coli of the 
serogroups 05, 026, 0103, 0111, 0118, 0145, and 0157 in a sample 
comprising a Sail-StuI fragment (C1-C2) comprising a sequence as 
shown in the Sequence Listing as SEQ ID NO: 2 and the sequence as 
shown the Sequence Listing as SEQ ID NOsS, reagents required for 
hybridization of the fragment with a DNA segment having a 
sequence encoding the attaching and effacing protein of 
enterohemorrhagic E.coli or an oligonucleotide fragment thereof, 
and directions for its use. 

31. A kit for detecting verotoxin-produclng E.coli of the 
serogroup 0157 in a sample comprising a 0.5 kb fragment (A3-B2) 
comprising a sequence as shown in the Sequence Listing as SEQ ID 
NO: 3 and the sequence as shown in the Sequence Listing as SEQ ID 
N0s4, reagents required for hybridization of the fragment with 
a DNA segment encoding the attaching and effacing protein of 
enterohemorrhagic E.coli or an oligonucleotide fragment thereof, 
and directions for its use. 



32. A kit for detecting pathogenic verotoxin-produclng E.coli 
strains of the serogroup 0157 in a sample comprising the primers 
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±-iEH as shown in the Sequence Listing as SEQ ID N0s5 and P2EH as 
shown in the Sequence Listing as SEQ ID N0s5, reagents required 
for the amplification in a PGR reaction of sequences which 
terminate with the nucleic acid sequence of one of the primers 
and the complementary nucleic acid sequence of the other of the 
primers, means for assaying amplified sequences, and directions 
for its use. 
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